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Vh I Abstract 

•^ i One of the most natural optimization problems is the fc-SET Packing problem, where given 

a family of sets of size at most k one should select a maximum size subfamily of pairwise 

disjoint sets. A special case of 3-Set Packing is the well known 3-Dimensional Matching 

problem, which is a maximum hypermatching problem in 3- uniform tripartite hypergraphs. Both 

C/j . problems belong to the Karp's list of 21 NP-complete problems. The best known polynomial 

^^ I time approximation ratio for A:-Set Packing is (fc + e)/2 and goes back to the work of Hurkens 

^ . and Schrijver [SIDMA'89], which gives (1.5 + e)-approximation for 3-DlMENSlONAL Matching. 

O I Those results are obtained by a simple local search algorithm, that uses constant size swaps. 

Halldorsson [SODA'95] has shown that logarithmic size swaps lead to an improved approx- 
imation ratio, however at the cost of quasipolynomial time complexity. Therefore a natural 
^ . question is whether it is possible to search the space of r-size swaps in c''poly(|J^|) time for 

'^ ' constant c and k. We show that this is most likely impossible, i.e. there is no such algorithm 

^ 1^ . with /(r)poly(|J^|) running time, unless W[1]=FPT, where / is some computable function, even 

for fc = 3. Therefore trying to find a c''poly(|J^|) time algorithm which searches the whole r-size 
swaps space is not the proper path. 

The main result of the paper is a new approach to local search for fc-SET Packing where 
^r^ . only a special type of swaps is considered, which we call swaps of bounded pathwidth. We show 

that for a fixed value of k one can search the space of r-size swaps of constant pathwidth in 
c'"poly(|J-'|) time. Moreover we present an analysis proving that a local search maximum with 
respect to 0(log | J"|)-size swaps of constant pathwidth yields a polynomial time (fc + 1 + e)/3- 
^\ , approximation algorithm, improving the best known approximation ratio for fc-SET Packing. 

In particular we improve the approximation ratio for 3-Dimensional Matching from 3/2 + e 
to 4/3 + e. 
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1 Introduction 

In the Set Packing problem, also known as Hypergraph Matching, we are given a family 
J-" C 2^ of subsets of U, and the goal is to find a maximum size subfamily of T of pairwise 
disjoint sets. Set Packing is a fundamental problem in combinatorial optimization with various 
applications. A simple reduction from Independent Set (where \J^\ = \V\) combined with the 
hardness result of Hastad [13j makes the Set Packing problem hard to approximate. When each 
set of Set Packing is of size at most k the problem is denoted as A;-Set Packing. 
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of sets of size at most k. 
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disjoint 


sets. 



k-SET Packing is a generalization of Independent Set in bounded degree graphs, as well 
as A;-DiMENSiONAL Matching and is related to plethora of other problems (see [6j for a list 
of connections between A;-Set Packing and other combinatorial optimization problems). In 3- 
DlMENSiONAL MATCHING the universe U is partitioned into U = X ^Y [^ Z and J" is a subset of 
X xY X Z. 

Both 3-DiMENSiONAL MATCHING and Set Packing are weh studied problems, belonging to 
Karp's list of 21 NP-hard problems [1^. A simple greedy algorithm returning any inclusionwise 
maximal subfamily of disjoint subsets of J- gives /c-approximation for A;-Set Packing. One can 
consider a local search routine, where as long as it is possible we remove one set from our current 
feasible solution and add two new sets. We say that such an algorithm uses size 2 swaps, as two 
new sets are involved. It is known that a local search maximum with respect to size 2 swaps is a 
{k + l)/2-approximation for A;-Set PACKING. If, instead of using swaps of size 2 we use swaps of 
size r for bigger values of r, then the approximation ratio approaches k/2, and that is exactly the 
{k/2 + e)-approximation algorithm by Hurkens and Schrijver [16]. 

Despite significant interest (see Section ri.2p for over 20 years no improved polynomial time ap- 
proximation algorithm was obtained for A;-Set Packing, even for the special case of 3-Dimensional 
Matching. Meanwhile Halldorsson |12] has shown that a local search maximum with respect to 
0(log |J^|) size swaps gives a (/c + 2)/3-approximation, which was recently improved to (A; + 1 + 
e)/3 ^. Nevertheless enumerating all ©(log \T\) size swaps takes quasipolynomial time. 

1.1 Our results and techniques 

Based on the work of Halldorsson [12J a natural path to transforming a quasipolynomial time 
approximation into polynomial time approximation would be by designing a c''poly(|J-'|) time algo- 
rithm, where c is a constant. This is exactly the framework of parameterized complexitju, where 
the swap size is a natural parameter. Unfortunately, we show that this is most likely impossible, 
i.e. there is no such algorithm with /(r)poly(|J-'|) running time, unless W[1]=FPT, where / is some 
computable function, even for A; = 3. We would like to note that W[1]/FPT is a widely believed 
assumption, in particular if W[1]=FPT, then the Exponential Time Hypothesis of [17] fails. 

Theorem 1.1. Unless FPT = W[l], there is no /(r)poly(|J^|) time algorithm, that given a family 
J-" C 2^ o/ sets of size 3 and its disjoint subfamily J-q '^ J- either finds a bigger disjoint family 
T\ ^ T or verifies that there is no disjoint family J-i ^ J- such that \J^q \Fi\ + \Ti \J'q\ < r, 

Therefore trying to find a c'polydJ-'l) time algorithm which searches the whole r-size swaps 
space is not the proper path. For this reason we introduce a notion of swaps (also called improving 



^For further information about parameterized complexity we defer the reader to monographs [9l fm[23 



sets) of bounded pathwidth (see Section [3TT]) . Intuitively a size r swap is of bounded pathwidth, if 
the bipartite graph where vertices represent sets that are added and removed, and edges correspond 
to non-empty intersections, is of constant pathwidth. Using the color-coding technique of Alon et 
al. [T] we show that one can search the space of swaps of size at most r of bounded pathwidth in 
c^polydJ-"!) time, for a constant c. As the currently best known analysis of local search maximum 
with respect to logarithmic size swaps of [8] relies also on swaps of unbounded pathwidth, we 
need to develop a different proof strategy, and the core part of it is contained in Lemma 13.71 
The algorithm and its analysis complete the main result of this paper, that is a polynomial time 
(/c + 1 + e)/3-approximation algorithm, for any fixed k and e. 

Theorem 1.2. For any e > and any integer k > 3 there is a polynomial time (A; + 1 + e)/3- 
approximation algorithm for k-SET PACKING. 

We believe that the usage of parameterized tools such as color-coding, pathwidth and W[l]- 
hardness in the setting of this work is interesting on its own, as to the best of our knowledge such 
tools have not been previously used in approximation local search algorithms. 

1.2 Related work 

Even though there was no improvement in terms of polynomial time approximation of /c-Set 
Packing (and 3-Dimensional Matching) since the work of Hurkens and Schrijver [16], both 
problems are well studied. 

One can also consider weighted variant of A;-Set Packing, where we want to select a max- 
imum weight disjoint subfamily of J^. Arkin and Hassin [2] gave a (A: — 1 + e)-approximation 
algorithm, Chandra and Halldorsson [7J improved it to {2k + 2 + e)/3-approximation, later im- 
proved by Berman [4J to {k + l + e)/2-approximation. All the mentioned results are based on local 
search. 

Also for the standard (unweighted) A;- Set Packing problem Chan and Lau [6j presented a 
strengthened LP relaxation, which has integrality gap {k + l)/2. 

On the other hand, Hazan et al [13] have shown that /c-Set Packing is hard to approximate 
within a factor of 0{k/ log k). Concerning small values of k, Berman and Karpinski [5] obtained 
a 98/97 — e hardness for 3-Dimensional Matching, while Hazan et al. [T5] obtained 54/53 — e, 
30/29 — e, and 23/22 — e hardness for 4, 5 and 6-Dimensional Matching respectively (note that 
a hardness result for A;-Dimensional Matching directly gives a hardness for A;-Set Packing). 

Recently Sviridenko and Ward [25] have independently obtained a (A: -|- 2)/3-approximation 
algorithm for A:-Set Packing. They observed that the analysis of Halldorsson [12] can be com- 
bined with a clever application of the color coding technique. However to the best of our under- 
standing it is not possible to obtain (A: + 1 + e)/3-approximation for A;-Set Packing using the 
tools of [25], and in particular Sviridenko and Ward do not improve the approximation ratio for 
3-DlMENSIONAL MATCHING. 

1.3 Organisation 

We start with preliminaries in Section [21 where we recall standard graph notation together with 
the definition of pathwidth and path decompositions. 

Section [3] contains the main result of the paper, that is the (A; + 1 + e)-approximation for A;-Set 
Packing. First, we introduce the notion of improving set of bounded pathwidth in Section [3. 11 In 
Section 13.21 we apply the color coding technique to obtain a polynomial time algorithm searching 
an improving set of logarithmic size of bounded pathwidth. In Section [3]3] we analyse a local search 



maximum with respect to bounded pathwidth improving sets of logarithmic size. The heart of our 
analysis is contained in an abstract combinatorial Lemma 13.71 which is later applied in the proof of 
Lemma 13.91 

The proof of Theorem 11.11 is given in Section HI Finally, in Section [5] we conclude with potential 
future research directions. 

2 Preliminaries 

We use standard graph notation. For an undirected graph G by V{G) and E{G) we denote the set of 
its vertices and edges respectively. By Ng{v) = {u : uv € E{G)} we denote the open neighborhood 
of a vertex v, while the closed neighborhood is defined as Nq[v] = Ng{v) U {v}. Similarly, for a 
subset of vertices X we have Ng[X] = IJ^gx ^g[v] and NaiX) = Ng[X] \ X. 

By a disjoint family of sets we denote a family, where each pair of sets is pairwise disjoint. For 
a positive integer r by [r] we denote the set {1, . . . , r}. 

Pathwidth and path decompositions A path decomposition of a graph G = {V, E) is a 
sequence P = {Bi)1^^, where each set Bi is a subset o vertices Bi C V (called a bag) such 
that IJi<i<o Bi = V and the following properties hold: 

(i) For each edge uv € E{G) there is a bag Bi in P such that u,v & Bi. 

(ii) 11 V (z Bi D Bj then v £ B^ for each min(i, j) < i < max(z, j). 

The width of P is the size of the largest bag minus one, and the pathwidth of a graph G is 
the minimum width over all possible path decompositions of G. Since our focus here is on path 
decompositions we only mention in passing that the related notion of treewidth can be defined 
similarly, except for letting the nodes of the decomposition form a tree instead of a path. 

In order to make the description easier to follow, it is common to use path decompositions that 
adhere to some simplifying properties. The most commonly used notion is that of a nice path 
decompositions, introduced by Kloks [19]; the main idea is that adjacent nodes can be assumed to 
have bags differing by at most one vertex. 

Definition 2.1 (nice path decomposition). A nice path decomposition is a path decomposition 
P = {Bi)j^^, where each bag is of one of the following types: 

• First (leftmost) bag: the bag Bi is empty, Bi = 0. 

• Introduce bag: an internal bag Bi of P with predecessor -Bj-i such that Bi = i?,;-i U {v} 
for some v ^ Bj-i. This bag is said to introduce v. 

• Forget bag: an internal bag Bi of P with predecessor i?i-i for which Bi = -Bj-i \ {v} for 
some V G -Bj_i. This bag is said to forget v. 

• Last (rightmost) bag: the bag associated with the biggest index, i.e. q, is empty, Bq = 0. 

It is easy to verify that any given path decomposition of width pw can be transformed in 
time |y(G)|pw'-^'^'' into a nice path decomposition without increasing the width. 



3 Local search algorithm 

In this section we present the main result of the paper, i.e. the (A; + 1 + e)-approxiniation algorithm 
for A;-Set Packing, proving Theorem 11.21 We start with introducing the notion of improving set 
of bounded pathwidth in Section 13.11 Next, in Section 13.21 we apply the color coding technique 
to obtain a polynomial time algorithm searching an improving set of logarithmic size of bounded 
pathwidth. In Section 13.31 we analyse a local search maximum with respect to bounded path- 
width improving sets of logarithmic size. The heart of our analysis is contained in an abstract 
combinatorial Lemma 13.71 which is later applied in the proof of Lemma 13.91 

3.1 Bounded pathwidth improving set 

Let us assume that an instance J-" C 2^ of A;-Set Packing is given. Moreover by J-q C J^ we 
denote some disjoint subfamily of T ^ which we can think of as a current feasible solution of a 
local search algorithm. In what follows we define a conflict graph, which is a bipartite undirected 
graph with two independent sets of vertices being J-q and T\J-o, where an edge reflects non-empty 
intersection. 

Definition 3.1 (conflict graph). For a disjoint family J^q C J^ by a conflict graph Gjrg we 
denote an undirected bipartite graph with vertex set T and edge set {S1S2 : ^i G J^o,S2 G 

Next, we define an improving set X Q T\Tq, which can be used to increase the cardinality of 
J-Q , and then we introduce a notion of an improving set of bounded pathwidth, which will be crucial 
in both the algorithm and the analysis of its approximation ratio. 

Definition 3.2 (improving set). For a disjoint family J-q C J^ a set X C J^ \ J^q is called an 
improving set, if the following conditions hold: 

• all sets of X are pairwise disjoint, 

• {Ngjt iX)\ < \X\, i.e. the number of sets of J-q having a common element with at least one 
set of X is strictly smaller than \X\. 

Observe, that if we have an improving set X, then {Tq \ Ngjt (X)) U X is a disjoint subfamily 
of J^ of size greater than \J-o\, hence the name improving set. 

Definition 3.3 (improving set of bounded path^vidth). An improving set X with respect to 
J-Q C- J- has pathwidth at most pw, if the subgraph of the conflict graph Gjfq induced by Nqj, [X] 
has pathwidth at most pw. 

3.2 Algorithm 

To find an improving set of bounded pathwidth we use the color coding technique of Alon et al. [1], 
which is by now a well-established tool in the parameterized complexity used for finding a set 
consisting of disjoint objects. We use two random colorings cj^^ : /"o — )• [r — 1], c;/ : C/ — >• [rk], 
where cjj ensures that the sets of X are disjoint, while cj-g is used not to consider the same set of 
Tq twice. Due to space constraints we postpone the proof of the following lemma to Appendix Rl 

Lemma 3.4. There is an algorithm, that given a disjoint family Fq C T, and two coloring functions 
cto '■ J~o ~^ k ~ 1]> ct/ : U ^ [rk] in 2^(''"'^)|J^|'-'(p") time determines, whether there exists an 
improving set X (^ T\J^q of size at most r of pathwidth at most pw, such that cj^^ is injective on 
Ngjt {X) and cjj is injective on (Jsgx ^- 



Theorem 3.5. There is an algorithm, that given a disjoint family Tq C T, in 2*-^''''^)|7^|'-^(p") time 
determines, whether there exists an improving set X Q J^ \To of size at most r of pathwidth at 
most pw. 

Proof. Observe, that if we take cj-q : J-q — >■ [r — 1] where the color of each set is chosen uniformly 
and independently at random, then for an improving set X of size at most r the function cjr^ is 
injective on A^Gjf (^) with probability at least 

(r - l)!/(r - 1)^-1 > ((r - l)/ef-V(r - If"^ = e"^"-^^ . 

Similarly, if we assign a color of [rk] to each element of U, then with probability at least e~'^^ 
the function cjj '■ U ^ [rk] is injective on IJsex '^- Therefore invoking Lemma 13.41 with random 
colorings cj^,cu at least e''~^+'"^ times would yield a constant error probability. 

To obtain a deterministic algorithm we can use the, by now standard, technique of splitters. 
An (n, a, 6)-splitter is a family H of functions [n] — t- [6], such that for any W C [n] of size at most 
o there exists f (z H that is injective on W. What we need is a small family of (n, a, a)-splitters. 

Theorem 3.6 ( [22]). There exists an {n, a, a) -splitter of size e"'a'~^^^°^"'' logn that can be constructed 
in 0{e"'a'^^^°^°'^n\ogn) time. 

Therefore instead of using random colorings cjr^^ , cjj we can use Theorem 13.61 to construct 
(|J-o|, r — 1, r — 1) and {\U\,rk, rk) splitters, leading to a deterministic algorithm, which finishes the 
proof of Theorem 13.51 D 

3.3 Analysis 

In this subsection we analyze a local search maximum, with respect to logarithmic size improving 
sets of constant pathwidth. It is well known that an undirected graph of average degree at least 2+e 
contains a cycle of length at most c^ log n, where the constant c^ depends on e. This observation 
was a base for the quasipolynomial time algorithms of [8l[12]. Here, however we need to generalize 
this result extensively, as the analysis of [8] relies on improving sets of unbounded pathwidth. 

Throughout this subsection we operate on multigraphs, as there might be several parallel edges 
in a graph, however there will be no self-loops. 

Lemma 3.7. Let H = {V,E) be an n-vertex undirected multigraph of minimum degree at least 3, 
where each edge e G E is associated with a subset of an alphabet We ^^ of size at most 7, where 
7 > 2. // each element c S S appears in at most 7 sets Wg, i.e. Vcgs \{e : e a E , c G We}\ < 7, then 
there exists a tree Tq = {Vo,Eq), which is a subgraph of H , and a vertex tq € Vq, such that: 

• \Vo\ <4(log3/2n + 2); 

• there exist two edges ei, 62 G E\Eo,ei 7^ 62 which have both endpoints in Vq; 

• Tq is a tree with at most 4 leaves; 

• for each pair of edges ei,e2 G £"0 such that We-^r\We2 7^ we have |distT'o(ro, ei)— distro('^O) 62)] < 
(3, where (3 = [log3/2(67^)l, and <X\stTo{ro,uv) = min(distTo(ro,n),distTo(ro, v)). 

Proof. Pick any r gV. We are going to construct a sequence of trees Tq, Ti, . . . rooted at r, which 
are subgraphs of H satisfying two invariants: 

• (exponential growth) for any < j < i the number of vertices in Tj at distance exactly j 
from r is exactly [(3/2)-' J, 



• (S-nearness) for any two edges ei, 62 of Tj [(we^riWe^ / 0, then |distT,(r, ei) — distr, (r, 62)] < 
p. 

Note that as Tq we can take a tree with a single vertex r. Let us assume, that Tj was the most 
recently constructed tree, and we want to construct Tj+i. Let V be the vertices of Tj at distance 
exactly i from the root r. By the exponential growth invariant, we have \V'\ = [(3/2)* J. Let 
E' Q E he the set of edges of H incident to V, but not contained in Tj. As each vertex in H is of 
degree at least three, we have 

l^'l > 2\V'\ > 2(3/2)* -2. 

If at least two edges of E' have both endpoints in Tj, denote those edges uv,u'v' € E', then as 
Hq we take the subtree of Tj induced by vertices on the paths between {u, v, u' , v'} and their least 
common ancestor rg. 

Therefore let E" C E' be the subset of edges having exactly one endpoint in Tj (that is in V'), 
\E"\ > 2(3/2)* - 3. Let 

Ebanned = {e & E : 3e,^E{T,_p)We H We' 7^ 0} , 

i.e. the set of edges having a non-empty intersection with We', where e' is not contained in the 
last (3 levels of Tj. As we want to maintain the S-nearness invariant, we want to use only edges of 
E" \ Ehanned- For this reason we upperbound the size of Ebanned as follows 

\Ebanned\ < ^l>j\ < l'^li3/2y\ < 2^^(3/2)^-^+' - 1) < 3^^(3/2^^ - 1) < ^^ - 37' 

i=0 i=0 

The first inequality follows from the assumption, that each set We is of size at most 7 and each 
element of S is contained in at most 7 sets We, whereas the last inequality follows from the choice 
of 13. Consequently 

\E" \ Ehannedl > \E"\ - \Ebanned\ > 2(3/2)* - 3 - (^^^ - 3jA > (3/2)*+^ + 1 . 

Let V" be the set of endpoints of edges of E'" = E" \ Efjanned not in Vi. Observe, that if 
\V"'\ < \E"'\ - 1, then either: 

• there exists three edges ei, 62, 63 G E'" having a common endpoint in y \ V{Ti), 

• there exist four edges ei, 62, 63, 64 G E'", such that ei, 62 have a common endpoint in V\V(Ti) 
and 63, 64 have a common endpoint in y \ V{Ti). 

In both cases we can extend the tree Tj be one or two edges to prove the theorem. 

Finally, we may assume that \V"'\ > \E"'\ — 1 > (3/2)*^"*^, therefore we can extend the tree 
Tj to a tree Tj+i, maintaining both the exponential growth and the S-nearness invariant. As the 
trees are growing exponentially, after a logarithmic number of iterations we will find the promised 
subgraph Hq. D 

Corollary 3.8. Let H = {V,E) be an undirected multigraph with n vertices and of minimum degree 
at least 3, where each edge e gV is associated with a subset of an alphabet Wg QTi of size at most 
7, for some 7 > 2, such that each element of S appears in at most 7 sets We- There exists a 
subgraph Hq = {Vq, Eq) of H , and a path decomposition (Bi)'^^-^ of Hq of width at most 4(/3 + 2), 
where j3 = |'log3/2(67^)] and 
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. lSo| = |^o| + l, 

• 1^0! <4(log3/2n + 2), 

• for each pair of edges 61,62 G Eq, such that We-^ n We2 7^ there exists a bag Bi for some 
1 < "^ < 9, such that all of the endpoints of both 61 and 62 are contained in Bi, 

• for each edge uv € Eq the set of indices {i : u,v (z Bi} is an interval. 

Proof. First, we use Lemma [3.71 to obtain Tq = {Vq^Eq), tq G Vq, such that |Vo| < 4(log3/2n + 
2), where for each pair of edges 61,62 G Eq such that Wei n We2 / we have |distTQ(ro, 61) — 
distTo(?"0)e2)| < /?. Let 61,62 £ E \ Eq he two edges with both endpoints in Vq. Define Hq = 
(VqjEq U {61,62}), clearly Hq is a subgraph H and it has more edges than vertices. Therefore it 
remains to show that there exists a path decomposition of Hq of width at most 4(/? + 2), satisfying 
the last two properties required by the corollary statement. 

Let Di be the set of vertices of Vq at distance exactly i from tq in Tq. Consider a sequence 
(-Bj)^^Q, where q = 4(log3/2 n + 2), and Bi = Umax(o,j-/3)<i<j A U 61 U 62. It is straightforward to 
check that this is in fact a path decomposition of Hq, and since Tq has at most 4 leaves, this implies 
that the size of each Di is upper bounded by 4, and hence the path decomposition is of width at 
most 4(/3 + 2). Finally, observe that the third property required by the corollary follows from the 
last property of Lemma 13.71 while the last (fourth) property required by the corollary is true for 
any path decomposition, as the intersection of any two intervals is an interval. D 

Lemma 3.9. Fix arbitrary e > 0. There are constants 61,62 (depending on k and e), such that for 
any disjoint family Fq C T, for which there is no improving set of size at most ei log re of pathwidth 
at most 62 we have \OPT\ < (e + {k + l)/3)|J-o|, where OPT Q J^ is a maximum size disjoint 
subfamily of J^. 

Proof Let C = Jq n OPT and denote Aq = I'qXC, Bq = OPT \C. Let Go be the subgraph of 
Gjr^ induced by AqL) Bq. We are going to construct a sequence of at most 1/e subgraphs of Go, 
namely Gj = Go[^i U Bi] for i > 1, where Ai C Aq, Bi C Bq, such that (*) in Gj there is no subset 
X Q Bi of size at most 2{k + 1)^/'"*, such that \NGiiX)\ < \X\. Observe that Gq satisfies (*) by 
the assumption that there is no improving set of size at most 2(fc + 1)^'^ < min(6i,e2). Consider 
subsequent values of i starting from 0. Split the vertices of Bi into groups Bj,Bf,Bf, consisting 
of vertices of Bi of degree exactly one, exactly two and at least three in Gj, respectively. Observe 
that because of (*) there is no isolated vertex of Bi in Gj and moreover no two vertices of Bf have 
a common neighbour in Gj. Consider the following two cases: 

• \Bf\ > e\OPT\: in this case we construct the graph Gj+i = Go[^j+i U Sj+i], where ^i+i = 
^i \ ^GiiBj) and i?j+i = Bf U Bf = Bi \ Bj. The invariant (*) is satisfied, as any set 
X C i?j_|_i of size at most 2{k + 1)^/'^"*"^ such that |A^g'-^-^(X)| < \X\ would imply existence 
of a set X' = X U {Ng{Ng{X)) n BJ) of size at most (k + l)- \X\ < 2{k + 1)^/'"*, such that 
\NgAX')\<\X'\. 

• \Bj\ < e\OPT\: We are going to show 

\Bf\<{l + e)\A,\. 

Assume the contrary. Construct a multigraph H = {Ai,EH), where Eh = {c-x = uv : x £ 
Bf,NGi{x) = {u,v}}. Set Yi = F and for each edge Cx = uv £ Eh., set as We^ the set of all 
vertices of Gq at distance at most 2/e from x. Observe that since Gq is of maximum degree 
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at most k, we have \we^\ < ll?''' . For the same reason each vertex of Go appears in at most 
2/0^/"^ sets We^ ■ 

In order to use Corollary 13.81 we need to reduce the graph H^ in a way ensuring all its vertices 
are of degree at least 3. However we know, that the graph H is of average degree at least 
2 + 2e, since |£^j:f |/|Aj| = |i?j^|/|Aj| > 1 + e. Let U' = H. As long as there exist an isolated 
vertex, or a vertex of degree one in H' remove it. Note that such a reduction rule does 
not decrease the average degree of H' . Similarly if H' contains a path vo,vi, . . . ,vi,Vi+i, 
where all vertices Vj for 1 < j < i are of degree exactly 2 and i > 1/e, then remove all the 
vertices Vj for 1 < j < i from H' . As this operation removes £ vertices, but only i + 1 edges, 
and I > 1/e, the average degree does not decrease. Finally, we construct H" from H' by 
simultaneously considering all the maximal paths vq,vi, . . . ,vi, w^+i, with all internal vertices 
of degree two, and contracting each of such paths to a single edge e' = uof^+i and setting 

As H" is of minimum degree at least 3, we apply Corollary l3.8l to it, where 7 = 2A;^/'^(l/e + l). 
Let Ho = {Vo,Eo) and P = {Bi)1^^ be as defined in CorollaryESl Let X C Bf be the set of all 
the vertices of Bf corresponding to the edges of £^o- As \Eq\ > \Vo\ we have \NQ^{X)\ < l-'^]. 
Clearly X is of size at most (9 + 41og3/2 |-7^|)7- It remains to show that we can lift X to an 
improving set of bounded pathwidth, without increasing its size significantly. 

Let Yi = X. For j = i - 1, ... ,0 set Yj = Yj+i U (iVG,(A^G,(>S)) ^ ^j)- Observe that at 
each step the size of Yj increases by a factor of at most k + 1, hence IYqI < |1^|(A; + 1)* and 
moreover Yq is an improving set w.r.t. J-q. Since Yq is of size logarithmic in | J^| it remains to 
show that Ngjt [Yq] is of constant pathwidth. 

Create a sequence of subsets P' = {Bl)1^^, by taking as B'^ the set {\Je=uveEo u veBi'^^e ^ 
^Gjr [Yq])- The size of each B'^ is at most {w + 1)^7, where w is the width of P, hence it 
remains to show that P' is indeed a path decomposition. Each vertex of Nqjt [Yq] is within 
distance at most 2/e from some vertex of X, hence since P is a path decomposition we have 
|J^<j< i?j' = Ngj- [Yq] and each edge of Nqj^ [Yq] has its both endpoints in some bag B'^. 
Finally the last two properties of Corollary 13.81 ensure that each vertex of Nqj, [Yq] appears 
in a set of bags i?,' forming an interval in the sequence P'. Therefore Yq is an improving 
set of logarithmic size and of constant pathwidth, which is a contradiction. Consequently 
\Bf\<{l + e)\A\. 

As each vertex of Ai is of degree at most k in Gi, the number of edges of Gi is at most A;|^j|. 
At the same time the number of edges of Gi is at least \Bj\ + 2\Bf\ + 3|i??|, therefore 

\Bl\ + 2\Bf\+3\Bf\<k\Ai\. 

Note that summing the inequalities: 

\B}\<€\Ai\ 
\Bl\<e\Ai\ 
\Bf\<{l + e)\A,\ 
\B}\ + 2\Bf\ + 3\Bf\ <k\Ai\ 
we obtain 

\Bi\ <((A; + l)/3 + e)|^i|. 



However \OPT\B,\ = \To\Ai\, hence |OPr| < ((/c + l)/3 + e)|J■o|• 
In the second case we have proved the thesis, while the first case can appear only 1/e number of 
times, as in each step we remove at least e\OPT\ vertices from Bi. Therefore the lemma follows. D 

Lemma 13.91 together with the algorithm for searching improving sets of bounded pathwidth 
from Theorem 13. 51 gives a polynomial time (A; + 1 + e)/3- approximation algorithm for /c-Set PACK- 
ING, which in particular is a (4/3 + e)-approximation for 3-DiMENSiONAL Matching and proves 
Theorem II. 2[ 

4 Local search hardness 

In this section we are going to show, that there is no algorithm verifying for a given J^q C J^, 
whether there exists an improving set (see Definition 13. 2p of size at most r in /(r)poly(|J^|) time, 
even when k = 3. In fact we show a stronger hardness result, ruling out existence of an algorithm, 
that either finds a bigger disjoint family Ti (without any restriction on its distance from J^q), or 
verifies that there is no improving set of size at most r. That is exactly the notion of permissive 
parameterized local search introduced by Marx and Schlotter in [2T] (for more information about 
parameterized local search see [20]). 

In our reduction, we use a standard W[l]-hard problem \T0\, namely Multicolored Clique 
parameterized by the clique size. 



Multicolored Clique 

Input: An undirected graph G = {V,E), a positive integer k, and a color function c : V ^ 

{0,...,k-l}. 

Question: Does the graph G contain a clique of size k, where each vertex is of different color? 



Theorem 4.1. There is a constant a > 0, such that given an instance I = {G,k,c) of Multi- 
colored Clique one can in polynomial time construct an instance J-" C 2 of 3- Set Packing, 
together with a disjoint subfamily Ti^ ^ T of size |C/|/3 — 1, such that: 

• If I is a YES-instance, then there exists a family J^i Q T of disjoint \U\/3 sets, such that 
1-^0 \ -T^il + 1-^1 \ -^ol < afc^, 

• if there exists a disjoint subfamily J-iQJ- of size \U\/3, then I is a YES-instance. 

Proof. We start with a definition of a simple gadget, that will be used a couple of times in the 
construction. 

Definition 4.2. For a positive integer h > 1 and a symbol x an {x, /i)-amplifier is a family J^x ^ 2^^^ 
of sets of size 3, where 

Ux = {xi,...,2;2.4h_i}, and 

^x = {{xi,X2i,X2i+i} : 1 < i < 4^^} 

Let I = {G = {V,E),k,c) be an instance of Multicolored Clique. W.l.o.g. we may assume 
that k = 4 , where /i is a positive integer, since otherwise we may add universal vertices (adjacent 
to all other vertices). We start with constructing an (x, /i)-amplifier, which will be called the top 
amplifier, and (v, /i)-amplifier for each v &V, called vertex amplifiers. As the universe U we take 

U = UxU{[J Uy)U {v[,v'l -.veVjU {s(ij) : < i < j < A;} U {^i : 1 < i < 2/;;} . 
To the family T we add all the sets of Tx and Ty for v € V, as well as: 



(i) sets {vi,v'i,Vi} for v €V, 

(ii) sets {xk+i,v'i,Vi} (or < i < k for v £ c^^{i), 

(iii) sets {uk+c{v),Vk+c{u),S(c{u),c{v))} for uv G E, c{u) < c{v), 

(iv) sets {vk+c{v)J2c{vyiJ2c{v)} for V eV, 

(v) sets {isi_2,hi-i,hi} for 1 < i < [2k/3\ (note that 2fc = 2 • 4^ = 2 (mod 3)), 

(vi) consider all the elements saj\ in lexicographic order of pairs {i,j), take subsequent triples of 
elements and add them to the family J-", that is add sets 

{■5(0,1)' ^{0,2)5 S(o,3)}' • • • ' {S{k-3,k-2)i S{fc-3,fc-l)' ■5(fe-2,fc-l)} 

(note that (2) = (mod 3), since {k - I) = (mod 3)). 
To finish the construction we create a disjoint family J-q of size \U\/3 — 1 as follows: 

• add to J-Q sets {xi,X2i,X2i+i} G /"a; for 1 < i < /c such that [log2 ij is odd. 

• add to J"o sets {vi,V2i,V2i+i} G J^v for v (^V and 1 <i < k, such that Llog2iJ is odd. 

• add to J-Q all the sets from points (i), (v), (vi) of the construction of J^. 

Note that the size of J-q equals \U\/3 — 1, as the only elements which are not covered by /q are 
xi, i-2k~i and l2k- 

Claim 4.3. If I is a YES-instance, then there exists a disjoint family J^i C J^ of size \U\/3, such 
that \J='i \ J-QJ + [J-Q \ jr^l = 0{k'^). 

Proof. Let i^ C y be a solution to /, that is a multicolored clique of size k. Construct a disjoint 
family Ti as follows: 

(a) add to J-'i sets {xi,X2i,X2i+i} G J-x for each 1 <i < k, such that [log2 ij is even, 

(b) add to J-'i sets {vi,V2i,V2i+i} G J^x for v ^K and 1 < i < /c, such that [log2 «J is even, 

(c) add to J-'i sets {vi,V2i,V2i+i} G /'a; for w G F \ K and 1 <i <k, such that [log2 i\ is odd, 

(d) for < i < k add to J-'i the set {xk+i, v[,Vi}, where v is the unique vertex of K of color i, 

(e) add to J-'i sets {vi, v[, v'{} for v gV \K, 

(f) add to J'l sets {uk+^(^y^),Vk+c{v),Sc{u),c{v)} for u,v e K, c{u) < c{v), 

(g) add to J'l sets {t'fc+c{^),^2c(i,)-i,^2c(i,)} for v e K. 

A direct check that the above family is disjoint and covers all the elements of U, hence [J-'i | = \U\/3. 
Note that in the above construction of J^i in each of the points (a), (d), (g) we add to J-'i only 
0{k) sets, while in points (b), (f) we add to Ti 0{k'^) sets, whereas in points (c) and (e) we add 
to J-'i sets that are present in J-'q. Therefore the number of sets of J-'i which are not present in J^q 
is upper bounded by a linear function in /c^. D 

Claim 4.4. If there exists a disjoint family T\ of size |f/|/3, then I is a YES-instance. 
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Proof. Let Ti C J^ be any disjoint family of size |f/|/3. Since the element xi can be covered only 
by the set {xi,X2,X3}, the family Ti contains all the sets {xi,X2i,X2i+i} G J^r for 1 < i < A:, 
where [log2 i\ is even, and consequently elements Xk+i for < i < /c are not covered by sets of J^x- 
Therefore elements Xk+i are covered by sets from point (ii) of the construction of J^, hence for each 
< i < k m Ti there is exactly one set {vi,V2,vs} G Ti for v € c~^(i), and let K be the set of 
those k multicolored vertices. 

We want to show that K is a clique. As for each v € K we have {vi,V2,V3} S J^i, the family 
Ti contains all the sets {vi,V2i,V2i+i} ioi 1 < i < k where [log2 ij is even. Consequently elements 
Vk+i for < i < k, i ^ c{v) are covered by sets from point (iii) of the construction of J-". Consider 
any pair < i < j < k. Denote as u the unique vertex of i^ n c^^(i) and let {uk+j,Vk+i, S(ij)} be 
the set of Ti covering Uk+j, where v € c~^{j). This implies that Vk+i is not covered by a set of the 
(u, /i)-amplifier, hence vi is covered by the (t), /i)-amplifier, i.e. by {vi,V2,V3}. Therefore v G K 
and the vertices of colors i and j of K are adjacent. Since i and j were selected arbitrarily, K is a 
clique. D 

The proof of Theorem 14.11 follows from Claim 14.31 and Claim 14. 4[ D 



Theorem |4.H together with the well-known fact that Multicolored Clique is W[l]-hard [TO] 
implies Theorem ll.il 

5 Future work and open problems 

One can try to continue the research direction of Chan and Lau [6] , who presented a strengthening 
of the standard LP relaxation, proving integrality gap of {k + l)/2 using a local search inspired 
analysis. We would like to ask a question whether it is possible to obtain some strengthened LP 
relaxation with integrality gap {k + c)/3-for some constant c. 

Finally, we believe that it is worth looking into other problems, where local search algorithms 
were apphed successfully, such as /c-Median [3J or restricted Restricted Max-Min Fair Allo- 
cation [24j . A potential goal would be to design improved approximation local search algorithms 
using non-constant size swaps in the spirit of the framework of this paper. 
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A Proof of Lemina 13.41 

Let us recall, that we use two random colorings cjr^ : Tq ^- [r — 1], cu : U ^ [rk], where cu ensures 
that the sets of X are disjoint, while cjr^ is used not to consider the same set of J-q twice. 

Lemma A.l (Lemma 13.41 restated). There is an algorithm, that given a disjoint family Tq C T, 
and two coloring functions cj^ : J-q ^- [r — 1], cjy : [/ ^ [rA;] in 2'-'(''''^)jJ^|'-'(P") time determines, 
whether there exists an improving set X C J^\J^q of size at most r of pathwidth at most pw, such 
that cj^g is infective on Ngj, (X) and cij is injective on Usex '^• 

Proof. Define an auxiliary directed graph D = {Vd, A forget U Aintroduce), where each vertex is 
characterized by a subset of set colors [r — 1], a subset of element colors [rk], an integer < j < r, 
and a subset of J- of size at most pw + 1, i.e. 

Vh = {v{Cr„Cu,j, B) : Cr, C [r - 1], C^ C [rfc], <3<r,B^T, \B\ < pw + 1} . 

Note that this graph has 0{2^^^^^' r\T\^'^^^) vertices. The integer j is used to store the size of an 
improving set under construction, while B ensures the bounded pathwidth property. 

To the set Aintroduce wc add the following arcs. For s = v{Cjrg,Ci/,j, B) G Vh-, S G J^ such that 
\B\ <pw: 

• if 5 G T\To, cu{S)r\Cu = 0, j < r, cj^ is injective on iYc^^^ (S) and ct^^(Ng:p^^ {S)\B)nCjr^ = 
0, then add to Aintroduce an arc {s,v{Cjr^,Cu UcuiS),j + 1,BU {S})) 

• if S G Joi cj-ji(S') CjTji and for each S' G B \ Tq either S G Nc-p (5"), or cjr^^,{S) 
cj^oi^GToi^'))^ then add to Aintroduce an arc {s,v{Cjrg U {cjrQ{S)},Cu,j,BU {S})) 

To the set A forget we add the following arcs. For s = v{Cjr^^,Cir,j, B) G Vh, S £ B add to 
A forget an arc (s, viCj^^ , Ci/,i, B \ {S})) if one of the following conditions holds: 

• S^To and c^„(iVG^^^(S)) C C^^. 

Claim A. 2. There exists a path in the graph D from the vertex f (0, 0, 0, 0) to a vertex v{CjrQ , Cu,j, 0) G 
Vd for {Cjtq \ < j iff there exists an improving set X of size at most r of pathwidth at most pw, such 
that cjr^ is injective on Nqj, {X) and cij is injective on IJsgx '^• 

Proof. Assume that there is a path si,. . . ,Sq in H, where Si = {Cjr ,Clj,ji,Bi), si = (0,0,0,0) , 
\Cjr I < jq and Bq = 0. Let X = Ui<i<q-^i \ -^o- By constriction of D, we have \X\ = jq < r. 
By the definition of Aintroduce and Aforget since Bq = 0, at the time a vertex v ^ X appears for 
the first time in some Bi we ensure that all its neighbors in Gj^^^ are either in Bi or are colored 
by cj^o with colors not yet in Cj: . Moreover at the time f G X is forgotten, i.e. removed from 
some Bi, we ensure that all of its neighbors in Gj^^ have been already added to bags. Therefore 
Ngj, [X] C |Ji<i<g-^« and for each edge e of G[Ngj, [X\\ its endpoints appear in some bag Bi. 
Since no set of Fq is added twice, due to the coloring cj-^j , no set oi T \Fo is added twice, due 
to the coloring cu, {Bi n Nqj, \X])1^^ is a path decomposition of Nqj, [X] of width at most pw. 
Finally, the counters ji ensure that \Ngj: {X)\ < \X\, since \Ngj: iX)\ < \Gjr \ < jq = \X\. Hence 
X is an improving set of size jq < r and of pathwidth at most pw. 

In the other direction, let X be an improving set of size at most r such that cj-^^ is injective on 
Ngjt (X), cu is injective on IJsgx ^^ ^^'^ ^^^ ^ ~ (-Si)i=i be a nice path decomposition of Ncjr [X] of 
width at most pw. For I < i < q define Si G Vd as Sj = f (cj-Q(i?j'nJ-o), cu{\JseB'\To '^)' l-^i\-^o|) Bi), 
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where B[ = Ukjxj-Sj. Observe that si = (0,0,0,0), s, = (Cj-,,, C^/, [X], 0) for |C^J < |X| and 
moreover if .Bj+i is an introduce bag, then (sj, Sj+i) G ^inirodnce while if -Bj+i is a forget bag, then 
(sj, Si+i) G A forget- Consequently there is a path from si to Sg in the graph D. 

U 

By the above claim it is enough to run a standard graph search algorithm, to check whether 
there exists a path from the vertex z;(0, 0,0,0) to f (Cj-^, C[/, j, 0) where |Cj-jj| < j, which finishes 
the proof of Lemma 13.41 As a side note we would like to add that if all the sets of J- are of size 
exactly k, then the integer j is not needed in a state description, as the number of sets oi J-\J-q 
added so far equals \Cu\/k. D 
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